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ABSTRACT 

We present measurements of the quasar two-point correlation function, £q, over the redshift range 
0.3 < z < 2.2 based upon data from the Sloan Digital Sky Survey (SDSS). Using a homogeneous sample 
of 30,239 quasars with spectroscopic redshifts from the Data Release 5 Quasar Catalogue, our study 
represents the largest sample used for this type of investigation to date. With this redshift range and 
an areal coverage of «4,000 deg 2 , we sample over 25 h~ 3 Gpc 3 (comoving) of the Universe in volume, 
assuming the current ACDM cosmology. Over this redshift range, we find that the redshift-space 
correlation function, £(s), is adequately fit by a single power-law, with s = 5.95 ± 0.45 hr 1 Mpc 
and 7 S = 1.16j^Q'}g when fit over 1.0 < s < 25.0 h^ 1 Mpc. We find no evidence for deviation from 
£(s) = at scales of s > 100 Mpc, but do observe redshift-space distortions in the 2-D £,{r p ,ir) 
measurement. Using the projected correlation function, w p (r p ), we calculate the real-space correlation 
length, r = 5.45±g;|| hr 1 Mpc and 7 = 1.90±^3, over scales of 1.0 < r p < 130.0 /i" 1 Mpc. Dividing 
the sample into redshift slices, we find very little, if any, evidence for the evolution of quasar clustering, 
with the redshift-space correlation length staying roughly constant at so ~ 6 — 7 h^ 1 Mpc at z < 2.2 
(and only increasing at redshifts greater than this). We do, however, see tentative evidence for 
evolution in the real-space correlation length, tq, at z > 1.7. Our results are consistent with those from 
the 2QZ survey and previous SDSS quasar measurements using photometric redshifts. Comparing our 
clustering measurements to those reported for X-ray selected AGN at z ~ 0.5 — 1, we find reasonable 
agreement in some cases but significantly lower correlation lengths in others. Assuming a standard 
ACDM cosmology, we find that the linear bias evolves from b ~ 1.4 at z = 0.5 to b ~ 3 at z = 2.2, 
with b(z — 1.27) = 2.06 ± 0.03 for the full sample. We compare our data to analytical models 
and infer that quasars inhabit dark matter haloes of constant mass Mhaio ~ 2 x 1O 12 /i _1 M0 from 
redshifts z ~ 2.5 (the peak of quasar activity) to z ~ 0; therefore the ratio of the halo mass for 
a typical quasar to the mean halo mass at the same epoch drops with decreasing redshift. The 
measured evolution of the clustering amplitude is in reasonable agreement with recent theoretical 
models, although measurements to fainter limits will be needed to distinguish different scenarios for 
quasar feeding and black hole growth. 

Subject headings: clustering - quasars: general - cosmology: observations - large-scale structure of 
Universe, general - surveys 



1. INTRODUCTION 

Understanding how and when the structures in the lo- 
cal Universe formed from the initial conditions present 
in the early Universe is one of the fundamental goals of 
modern observational cosmology. Tracing the evolution 
of clustering with cosmic epoch offers the potential to un- 
derstand the growth of structure and its relation to the 
energy and matter content of the Universe, including the 
relationship between the dark matter and the luminous 
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galaxies and quasars that we observe. 

As such, one of the prim ary science goals o f the Sloan 
Digital Sky Survey (SDSS; I York et al.ll2000[) is to mea- 
sure the large-scale distribution of galaxies and quasars, 
and in particular, to determine t he spatial cl u sterin g of 
quasars as a function of redshift. IShen et ah! (|2007| ) re- 
port on the clustering of high redshift (z > 2.9) quasars 
from the SDSS; in this paper, we investigate the spa- 
tial clustering from redshift z = 2.2 to the present day, 
i.e. the evolution of quasar clustering over nearly 80% 
of the age of the Universe (the gap in redshift being a 
consequence of the optical selection techniques used in 
the SDSS). 

Due to their high intrinsic luminosities, quasars are 
seen to large cosmological distances, and are thus good 
probes of large-scale structure (LSS) and its evolution. 
However, until recently, quasar studies were plagued by 
low-number statistics, leading to shot noise, and samples 
covered only small areas of sky, leading to sample vari- 
ance. With the advent of large solid angle (> 1000 deg 2 ) 
surveys with efficient selection techniques, these limita- 
tions have been overcome, and the number of known 
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quasars has increased by more than an order of magni- 
tude in the last decade, thanks mainly to the 2dF QSO 
Reds hift Survey (2QZ; iBovle et all 120001: ICroom et all 
120041) and the SPSS. T he latest SDSS quasar catalogue 
( Schneide r et al.l l2007h contains nearly 80 000 objects. 
Using the data from these large surveys, we are now in a 
position to make high-precision measurements of quasar 
clustering properties. 

The two-point correlation function (2PCF), £, is a sim- 
ple but powerful statistic commonly employed to quan- 
tify the cluste ring properties of a given class of object 
(|Peebledll980[ ). The observed value of £ for quasars can 
be related to the underlying (dark) matter density dis- 
tribution via 



£(^*)quasar &Q £(^*)mattcr 



(1) 



where £(r) ma tter is the mass correlation function and 6q 
is the linear bias parameter for quasars. Although equa- 
tion (JTJ defines 6q, and there are theoretical arguments 
sug gesting that bo is scale-inde pendent on large scales, 
e.g. IScherrer fc Weinberg! (|1998l ). we do not know a pri- 
ori if this is the case. 

With certain reasonable assumptions, the measure- 
ment and interpretation of the bias can lead to de- 
termination of the dark matter halo properties of 
quasars and to quasar lifetim es (t q , iMartini fe Weinberg) 
120011 : lHaiman fc Huil 12001). In the standard scenario, 
quasar activity is triggered by accretion ont o a cen- 
tral, supermassive black hole ( SMBH , e.g. iSalpeterl 
[1961 iLvnden-Belll fl969t iReesI Il98l . Given the 
possible connection between the SMBH and host 
halo, and the fact that halo properties are cor- 
related with the local density contrast, clustering 
measurements can be used to constrain this poten- 
tial halo-SMBH connection and provide an insight 
into quasar and blac k hole physics (e.g. iBaes et al.l 
2001 iWvithe fc Loebl 120051 IWyithe fc Padmanabhanl 
20061 lAdelberger fc SteideT l2005bt iFine" et al .1 120061 : 
da Angela et alj 120081 ). This information, combined 



with the quasar luminosity function (QLF), constrains 
f], the fraction of the Eddington lumi nosity at which 
quasars shine, and their duty cycle (jWvithe fc Loebl 
l200l iShankar et al.ll200l . 

Early m e asurements of the quasa r 2PCF (e.g . 
Anil 119701: [Hawkins fc Reddish! Il975l : lOsmeri fl98ll : 
Shanks et al.lll983l 119871 ) measured statistically signifi- 
cant clustering on scales of a few h^ 1 Mpc, for both the 
quasar auto-correlation function and cross-correlation 
with galaxies. This result has be en confirmed with 
data from more rece nt surveys, (e.g. ICroom et al.l 120051 : 
iPorciani et"aT1l2004f K The Quasar 2PCF is typically fit 
to a single power law of the form, 



£(r) = {r/r a y 



(2) 



over the range 1 ft -1 Mpc < r < 100 hr 1 Mpc. Here, ro 
is the correlation length quoted in comoving coordinates 
and 7 is the power-law slope. Typical measured corre- 
lation lengths and slopes for quasars at redshift z <~ 1.5 
are r = 5 — 6 h^ 1 Mpc and 7 ~ 1.5, respectively. 

The evolution of the quasar correlation function has 
been disputed for a long time, with some authors report- 
ing that ro eit her decreased or only w e akly evolved with 
redsh ift (e.g. Ilovino fc Shaverl 119881: ICroom fc Shanks! 
1996), while others reported an increase with redshift 



(e.g. lKundidll997HLa Franca et al.lll998f ). However, with 
the advent of the 2QZ Survey, r has been shown to 
evolve at the ~ 90 — 99% confidence level, in the sense 
that quasar clustering increases with redshift, although 
the actual degree of evolution is weak (ICroom et al.l!2001t 
Porciani et ap 12004 ICroom et afll2005h . In particular. 
Croom et al. r pooi used over 20 000 objects from the 
final 2QZ dataset to measure the redshift-space two- 
point correlation function, £(s), over the redshift range 
0.3 < z < 2.2 and found a significant increase in the 
clustering amplitude at high redshift. The quasar bias, 
where the bias depends on the underlying CDM model 
such that a constant ro can imply a strongly varying b, 
was found to be a strong function of redshift, with an 
empirical dependence of 

b Q ( z ) = (0.53 ± 0.19) + (0.289 ± 0.035)(1 + zf . (3) 

These values were used to derive the mean dark matter 
halo (DMH) mass occupied by quasars, giving a redshift- 
independent value of M DM h = (3.0 ± 1.6) x lO^h^Mp, 
Indep endent analysis of the 2QZ data by IPorciani et all 
(|2004f ) confirmed th ese findings . 

Using the SDSS, IShen et al.1 (|2007l ) found that red- 
shift 2.9 < z < 5.4 quasars are significantly more clus- 
tered than their z ~ 1.5 counterparts, having a real- 
space correlation length and power-law slope of ro = 
15.2 ± 2.7 hr 1 Mpc and 7 = 2.0 ± 0.3, respectively, over 
the scales 4 hr 1 Mpc < r p < 150 h^ 1 Mpc (where r p is 
the sepa ration from the p rojected correlation function, 
w p (r p )). IShen et ail (|2007l ) also found that bias increases 
with redshift, with, 6q ~ 8 at z — 3.0 and &q ~ 16 at 
z = 4.5. 

iMvers et all (|2006l l2007f) . also using SDSS data, ex- 
amined the clustering of photometrically-selected quasar 
candidates over ~ 50 /i _1 kpc to ~ 20 hr 1 Mpc scales. In 
this sample, quasar redshifts were assign ed from photo- 
metr ic rather than spectral information (Richards ct al. 
2001). They found that the linear bias, bq, increases 
with redshift, from bq = 1.93 at redshifts 0.4 < z < 1.0 
to bq = 2.84 at 2.1 < z < 2.8, co nsistent with equa- 
ti on @ (Fig. 4 of IMvers et al.l l2007t). 

iPadmanabhan et al.l (|2008f ) measured the clustering of 
photometrically-selected luminous red galaxies (LRGs) 
around a low redshift, 0.2 < z < 0.6, sample of quasars, 
with both LRG and quasar samples coming from the 
SDSS. They determined a large-scale quasar bias 6q = 
1.09±0.15 at a median redshift of z — 0.43. After taking 
into account me asurement and interpretati on subtleties, 
the results from IPadmanabhan et al.l (l2008f). are i n qua l- 
itative agreement with those from ISerber eTai] (|200ll . 
who find that Mi < —22, z < 0.4 quasars are located in 
hig her local galaxy ove rdensities than typical L* galax- 
ies. ISerber eta l. (2006) suggested that quasars typically 
reside in L* galaxies, but have a local excess of neigh- 
bours within ~ 0.15 — 0.7 hr 1 Mpc, which contributes 
to the triggering o f quasar activity thr ough mergers and 
other interactions. IStrand et al.l (f2008f ) using photomet- 
ric redshift cuts, confirm the basic overdensity value s 
measured by ISerber et all (l2006h . iHennawi et at] (l2006h. 
IMvers et all (|2007l) and IMvers et alj (|2008ft reached sim- 
ilar conclusions by examining pairs of quasars on < 
1 hr 1 Mpc scales. The quasar correlation function shows 
a sma ll scale excess over a power law, and IHennawi et al.1 
(2006) suggested that the small-scale excess can be at- 
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tributed to dissipative interaction events that trigger 
quasar activity in rich environments. 

Due to the evolution of the quasar luminosity function 
and the flux-limited nature of most quasar samples, there 
is a strong correlation between redshift and luminosity 
in these samples, making it difficult to isolate luminos- 
ity dependence of clustering from redshift dependence. 
Recently, Ida Angela et al.l (|2008l ) combined data from 
the 2QZ and the 2SLAQ Survey (2dF-SDSS LRG And 
QSO Survey: ICroom et al.| [2008). to investigate quasar 
cluste ring and break this degeneracy. Ida Angela et al.l 
(2008) estimate the mass of the dark matter haloes which 
quas ars inhabit to be ~ 3 x 10 12 /i _1 M Q , in agreement 
with ICroom et all (|2005h . a value that does not evolve 
strongly with redshift or depend on QSO luminosity. 
Their results also suggest that quasars of different lu- 
minosities may contain black holes of similar mass. 

There have also been recent advances in theo- 
retical predictions of the quasar correlation func- 
tion and its evolut i on with redshift (iLidz et al.l 120061: 
Hopkins etai] 120071: IShankar et all 120071 : iHopkins et all 
2008 : i iBasilakos et aLl feoOS) and we discuss these models 
in more detail in Sections 4 and 5. 

In this paper, we shall measure the quasar 2PCF for 
rcdshifts z < 2.2, using the largest sample of spectroscop- 
ically identified quasars to date. We will investigate the 
dependence of quasar clustering strength with redshift 
and luminosity, allowing tests of current quasar forma- 
tion and evolution models. 

This paper is organised as follows. In Section 2 we 
present our data sample, mentioning several effects that 
could give rise to systematics in the measurements. In 
Section 3 we briefly describe the techniques involved in 
measuring the two-point correlation function and in Sec- 
tion 4 we present our results. In Section 5 we compare 
and contrast our evolutionary results with recent obser- 
vational results in the literature, and we conclude in Sec- 
tion 6. Appendix A gives technical details for the SDSS, 
Appendix B describes our error analysis and Appendix 
C carries out a series of sy stematic checks. 

In our companion paper (|Shen et al.ll2009t ). we expand 
our investigations on the clustering of SDSS quasars. Us- 
ing the same data as we examine here, Shen et al. study 
the dependence of quasar clustering on luminosity, virial 
black hole mass, quasar colour and radio loudness. 

We assume the currently preferred flat, "Lambda 
Cold Dark Matter" (ACDM) cos mology where ft h = 
0.042, n m = 0.237 , Q A = 0.763 (ISanchez et al.l 120061 : 
ISpergel et al.l l2007f ) and quote distances in units of 
h^ 1 Mpc to aid in ease of comparisons with previ- 
ous results in the literature. Since we are measuring 
objects with redshifts resulting from the Hubble flow, 
all distances herein are given in comoving coordinates. 
Where a value of Hubble's Constant is assumed e.g. 
for absolute magnitudes, this will be quoted explicitly. 
Our magnitudes are based on the AB zero-point system 
(lOke fc Gunrifl983h . 

2. DATA 

Much care must be taken when constructing a dataset 
that is valid for a statistical analysis. In this section and 
Appendix A we describe the various samples we use to 
investigate potential systematic effects in our clustering 
measurements. Appendix A provides some of the rele- 



vant technical details of the SDSS, discussing the Cata- 
logue Archive Server (CAS) and the SDSS Survey geom- 
etry. 

2.1. The Sloan Digital Sky Survey 

The SPSS uses a de dicated 2.5m wide-field tele- 
scope (|Gunn et al. 2006) to collect light for 30 2kx2k 
CCDs JG unn etai] fl998h over five broad bands - 
ugriz (Fukug ita et al.l I1996T ) - in order to image 
~ 7r steradians of the sky. The imaging data 
are taken on dark p hotometric nights of good see- 
ing dHogg et"aLl 120011) and are calibrated photometri- 
cally (ISmith et all 120021; llvezic et alj 12004 iTucker et all 
l2006t Padmanabh an et al.l l2008h . and astrometrically 



Pier e t al. 200 3]), and object parameters are measured 



Lup ton et al.ll2001l: IStoughton et al.ll2002fh 

Using the imaging data, quasar target candidates 
are selected for spectroscopic observation based on 
their colours, magnitudes and dete ction in the FIRST 
radio survey dBecker et all I1995D . as described by 
iRichards etai] (|2002t ). " Unless stated otherwise, all 



quoted SDSS photo metry has been c orrect ed for Galactic 
extinction following Schleg eTet alj |l998). Here we are 
concerned with onl y those quasars select ed from the main 
quasar selection (jRichards et al.l l2002f ). Low-redshift, 
z < 3, quasar targets are selected based on their loca- 
tion in w^ri-colour space and the high-redshift, z > 3, 
objects in griz-colowc space. Quasar candidates passing 
the ugri-colout selection are selected to a flux limit of 
i = 19.1, but since high-redshift quasars are rare, objects 
lying in regions of colour-space corresponding to quasars 
at z > 3 are targetted to i = 20.2. Furthermore, if an 
unresolved, i < 19.1 SDSS object is matched to within 
2" of a source in the FIRST catalogue, it is included in 
the quasar selection. 

A tiling algorithm then assigns these candidates to 
specific spectrosc opic plates, in order to maximise tar- 
get completeness ()Blanton et al.l 120031 ) . Each 3° diame- 
ter spectroscopic plate holds 640 fibres and quasar can- 
didates are allocated at a density of approximately 18 
fibers deg -2 . No two fibres can be placed closer than 
55", corresponding to ~ 0.7 h^ 1 Mpc at (z) = 1.27, the 
mean redshift of our sample (Fig. [23j) . In the case of 
conflicts because of this 55" constraint, the main quasar 
selection candidates were given targetting priority over 
the MAIN galaxy and LRG s urvey targets (jStrauss et al.l 
l2002t lEisenstein et alj I200H , respectively). Therefore, 
excluding subtle effects due to gravitational le nsing 
(jScrantonet alJ 120051 : iMountrichas fc Shanks! 120071 ). the 
LSS 'footprint' of these foreground galaxies should not af- 
fect our LSS quasar measurements. Some targets, includ- 
ing brown dwarf and hot subdwarf calibration star can- 
didates, were given higher priority than the main quasar 
candidates. However, since the surface density of these 
Galactic objects is very low (<C 1 deg -2 ), this should 
not have any significant impact on our results. We in- 
vestigate the effects of quasar-quasar fibre collisions in 
Appendix IC.61 

2.2. Quasar Samples 

For our analysis, we use the SPSS D ata Release 
Five (PR5; lAdelman-McCarthv et al.l l2007h and select 
quasars from the latest versio n of the quasar catalogue 
(DR5Q; ISchneider et al.ll2007l ). This catalo gue consists 



4 



N. P. Ross et al. 



-30 




0.5 1.0 1.5 2.0 
redshift, z 

Fig. 1— The SDSS DR5 Quasar L - 2 plane for the DR5Q 
(black points) and the UNIFORM sample (red points). The affect 
of the i = 19.1 magnitude limit can clearly be seen. Mi is the i- 
band absolute magnitude at the plotted redshift where we use the 
i-f-correction given by Table 4 of lRichards et al.l [120061 ). 



of spectroscopically identified quasars that have lumi- 
nosities larger than Mj = —22.0 (measured in the rest 
frame) and at least one emission line with FWHM larger 
than 1000 km s _1 . Every object in the DR5Q had its 
spectrum manually inspected. There are 77 429 con- 
firmed quasars over the 5 740 deg 2 spectroscopic DR5 
footprint; the 65 660 DR5Q quasars with redshifts z < 
2.2 will be the parent sample we use in this investigation. 

At z > 2.2 the "ultra-violet excess" (UVX) method 
of selecting quasars begins to fail due to the Lya-forest 
suppressing flux as it moves through the SDSS M-band, 
and q uasars have colours similar to those of F-stars (|Fanl 
119991 ). Thus, for 2.2 < z < 2.9, the completeness of the 
survey is dramatically lowered as is discussed in detail by 
iRichards etai] (|2006l ). A lower redshift limit of z = 0.30 
is chosen to match that of the 2QZ. Therefore, although 
we will present results in the redshift ranges z < 0.30 
and 2.2 < z < 2.9, we will not place strong significance 
on these data. The number of quasars used in this study 
is twice that of t he previous largest quasar survey, the 
2QZ (jBovle et al.l [20001 : iCroom et all l200l and allows 
division of our sample in luminosity and redshift bins 
while retaining statistical power. As shown in Sections 
4 and 5, these new data complement the existing 2QZ 
and 2SLAQ quasar survey results, and together improve 
constraints on theoretical models. 

We construct two subsamples from DR5Q. The first 
is designated as the "PRIMARY" Sample, which will in- 
clude those objects in the DR5Q which were targ etted as 
primary quasar candidates ([Richards et al.|[2002t ). having 
satisfied one, or more, of the TA RGET_QS0 , TARGET_HIZ 
or TARGET _FIRST selections (see IStoughton et al.l I2002L 
Section 4.8, for more details on these flags). The SDSS 
quasar survey was designed to be complete in the pri- 
mary sample, and no attempt was made at completeness 
for the quasars selected by other means. In total there 
are 55 577 quasars in the DR5Q that had their target 



flags set to one (or more) of these primary flags, with 
46 272 quasars satisfying our high redshift limit (Table 

The SDSS quasar selection algorithm was in flux in the 
early part of the survey, and was only finalised after DR1 . 
We define the "UNIFORM" sample to be those primary 
objects selected with this final version. The UNIFORM 
sample is flux limited to i = 19.1 at z < 2.9 and contains 
38 208 objects, dropping to 31 290 when a redshift cut of 
z < 2.2 is applied. We show the distribution of objects 
in the redshift-luminosity plane for the full DR5Q and 
0.30 < z < 2.2 UNIFORM sample in Figure Q] We will 
use both the PRIMARY and the UNIFORM samples in 
what follows, but will find inconsistent results between 
the two samples at scales > 60 h^ 1 Mpc. This is inves- 
tigated further in Appendix C. 

The quasar correlation function is sensitive to a num- 
ber of potential systematic effects, including bad pho- 
tometry and improperly corrected dust reddening. Since 
quasars are selected by their optical colors, we shall per- 
form checks on both our PRIMARY and UNIFORM 
samples in Appendix C to see wh at effect regions with 
poor photometry (as defined by IRichards et all [20061 : 
iShen et al.l [2001 has on our clustering measurements. 

While all selection for the quasar sample is undertaken 
using dereddened colors (Richards et al 2001), if there re- 
main systematic errors in the reddening model they can 
induce excess power into the clustering in a number of dif- 
ferent ways. Appendix C describes how these effects af- 
fect our £ (s) measurements and the interpretations based 
thereon. Briefly, we find that: the UNIFORM sample is 
the most stable sample for our studies; reddening and 
bad fields produce insignificant effects to our measure- 
ments; our results are insensitive to the choice of the 
upper bound of the integral in equation @ (7r m ax, see 
Section 3.2) and the comoving z max and fibre collisions 
are not a concern on the scales we investigate. 

3. TECHNIQUES 

In this section we describe the techniques we shall use 
to calculate the Quasar z < 2.2 2PCF. The interested 
reader is referred to the comp 
(fl980l [19931): iPeacockl (fl999l) 



ehensive texts of 


Peebles! 


IColes & Lucchin 


(20021) 



and lMartfnez & Saari (|2002f ) for full details on the 2PCF. 

3.1. Estimating the 2-Point Quasar Correlation 
Function 

In practice, £ is measured by comparing the actual 
quasar distribution to a catalogue of "random" points, 
which have the same selection function, angular mask 
and radial distribution as the data, but are spatially 
distributed in a "random" manner - i.e. are not clus- 
tered. The construction of this random sample shall be 
described in Section I3~2l 

We use the estimator of lLandv fc Szalavl (|1993j ) to cal- 
culate £, as this has been found to be the most rel i- 
able estimator for 2PCF studies (jKerscher et al.l [2000). 
Comparing our r esults to th ose u sing the es t imato rs of 
iDavis fc Peebles! (DP, Il983h and iHamiltonl ([1992D . we 
find the DP estimator causes systematic errors on large 
scales with too much power at s > 40 hr 1 Mpc, as this 
estimator is less robust to errors in the estimation of 
mean density. The LS estimator is given by, 
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TABLE 1 

The SDSS Spectroscopic Quasar Samples 



Sample 
Description 












Area 
/dcg 2 


Number 
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z max 


^mcd 


DR5Q 












ft: 5740 


77 




0.078 


5.414 


1.538 








2 


< 


2.9 




71 


375 


0.078 


2.900 


1.372 




0.3 


< 


2 


< 


2.9 




69 


692 


0.300 


2.900 


1.400 








2 


< 


2.2 




65 


660 


0.078 


2.200 


1.278 




0.3 


< 


2 


< 


2.2 




63 


977 


0.300 


2.200 


1.306 


PRIMARY 












5713 


55 


577 


0.080 


5.414 


1.543 








Z 


< 


2.9 




50 


062 


0.080 


2.900 


1.326 


55 


0.3 


< 


2 


< 


2.9 




48 


526 


0.300 


2.900 


1.360 








2 


< 


2.2 




46 


272 


0.080 


2.200 


1.234 




0.3 


< 


2 


< 


2.2 




44 


736 


0.300 


2.200 


1.268 


UNIFORM 












4013 


38 


208 


0.084 


5.338 


1.575 








2 


< 


2.9 




33 


699 


0.084 


2.900 


1.319 




0.3 


< 


2 


< 


2.9 




32 


648 


0.300 


2.900 


1.234 


55 






2 


< 


2.2 




31 


290 


0.084 


2.200 


1.354 


55 


0.3 


< 


2 


< 


2.2 




30 


239 


0.300 


2.200 


1.269 



Note. — The SDSS Spectroscopic Quasar Samples used in our analysis, 
with minimu m, maximum an d medi an redshifts. The DR5Q is the catalogue 
presented in ISchneider et al.l ( f2007l) . while the PRIMARY and UNIFORM 
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Here N and iV r( j are the number of data and random 
points in the sample, DD(s) is the number of data- 
data pairs with separation between s and s + As in the 
given catalogue, DR(s) is the number of data-random 
pairs, and RR(s) the number of random-random pairs. 
The angled brackets denote the suitably normalised pair 
counts, since we employ at least twenty times more ran- 
dom points than data in order to reduce Poisson noise. 
We choose our bins to be logarithmically spaced, with 
widths of Alog(s/ h- 1 Mpc) = 0.1. 

The measurement of a quasar redshift will not only 
have a (large) component due to the Hubble expan- 
sion, but also components due to the intrinsic pecu- 
liar velocities and redshift errors associated with the 
individual quasar. The peculiar velocities can been 
seen in the redshift-space correlation function, both at 
small a nd large scales (see S ection 4). However, as 
noted i n Schn eider et al.l (|2007f ) and discussed in detail in 
iShen et all (|2007t Appendix A), quasar redshift determi- 
nation can have uncertainties of a v — 500 — 1450 km s _1 
and hence a z = 0.003 — 0.01, and these redshift errors 
will dominate any determination of the peculiar velocity 
signal. 

The real-space correlation function, £(r), is what would 
be measured in the absence of any redshift-space distor- 
tions. We can measure £(r) by projecting out the effects 
of peculiar velocities and redshift errors along the line of 
sight. 

One can resolve the redshift-space separation, s, be- 
tween two quasars into two components, r p and 7r, where 
r p is the separation between two objects perpendicular 
to the line-of-sight and it is the separation parallel to the 
line-of-sight. Thus, 

* 2 = rl + ^ (6) 



(where r p = a is also found in the literature). The '2- 
D' redshift-space correlation function, £(r p ,7r), can be 
calculated as before, 



(DD(r p ,ir)) - (2DR(r p ,n)) + (RR(r p ,ir)) 



{RR(r p ,n)) 



(7) 
be 



where the bin sizes are now chosen to 
Alog(r p / h- 1 Mpc) = Alog(7r/ hr 1 Mpc) = 0.2. 

Redshift-space distortions affect only the radial com- 
ponent of £(?~p,7r); thus by integrating along the line-of- 
sight direction, 7r, we obtain the projected correlation 
function, 



£(r p ,ir)dir. 



(8) 



In practice we set the upper limit on the integral to be 
TTmax = 10 1 * 8 = 63.1 h^ 1 Mpc and show that although 
varying this limit does cause some difference to the de- 
duced w p (r p ), it does not cause significant changes to 
the 2PCF over the scales of interest for our studies (Ap- 
pendix IC.7P . 

Th e integral in equation [5] can be rewritten in terms of 
£(r) l|Davis fc Peebledll983ft . 

nax r£(r) 



zdr. 



(9) 



If we assume that £(r) is a power law of the form, 
£(r) = (r/ro)~ 7 (which, as we shall find later, is a fair 
assumption), then equation [S] can be integrated analyti- 
cally, such that with 7r max = oo, 

'(§)r(V; 



w p (r p ) 



7 1- 
= r r p 



r(i) 



ri-M( 7 ), (10) 



'0'p 



where T(x) is the Gamma function. 

In linear theory and in the absence of small-scale veloc- 
ities and redshift errors, the redshift-space and real-space 
correlation function can be related via 



m = £(r) 



l + ^(z) + i/3 2 (z) 



(11) 
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Fig. 2. — The SDSS DR5 Quasar N(z). The solid (red) histogram 
shows the quasar redshift distribution for the PRIMARY sample, 
while the dashed (blue) histogram shows the redshift distribution 
for the UNIFORM sample. The thin lines for both PRIMARY and 
UNIFORM do not include the 0.3 < z < 2.2 cuts. As a comparison, 
the full DR5Q sample is given by the dotted (black) histogram. 

where 

m = (12) 

b(z) 

parametrizes the 'flattening' at large scales of the cor- 
relation function due to the infall of matter from un- 
derdense to overdense regions. The value of (3(z) has 
trad itionally bee n measured via fits to observed data 
(e.g. [K aiser 1987; Fisher et al. 1994; Peac ock et aDl200lL 
lHawkins et "aDl2003h iRoss et al.ll2007t iGuzzo et al1l2008| ) . 

3.2. Construction of the Random Catalogue 

As mentioned above, to calculate £ in practice, one 
needs to construct a random catalogue of points that 
mimics the data in every way, bar its clustering sig- 
nal. The angular mask and completeness for the PRI- 
MARY and UNIFORM sample is described in detail in 
Appendix A. 

The radial distribution of the sample is measured from 
the data themselves. Figure [2] shows the N(z) distri- 
bution of the DR5Q quasars from our samples. We fit 
a tenth-order polynomial to both the PRIMARY and 
UNIFORM samples, which we use to generate the ran- 
dom sample redshift distribution. This method has 
proved reliable in previous quasar clustering studies (e.g. 
ICroom et all 120051 : Ida Angela et al.ll2008h . 

3.3. Errors and Covariances 

Recent studies (e.g. IScranton et alJl2002T : IZehavi et al.l 
l2002tlMvers et alj|2006t IRoss et al.ll2007D have employed 
three main methods, Poisson, Field-to- Field and Jack- 
knife to estimate errors in correlation function measure- 
ments. T he 'simpl e st' of these is the Poisson error de- 
scribed bv iPeeblea (|1973l ): this is the Poisson noise due 
to the number of pairs in the sample, 
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Fig. 3.— The SDSS Quasar redshift-space 2PCF, £(s), from the 
UNIFORM sample (filled circles). The solid line shows the best 
fit single power-law model over 1 < s < 25.0 h' 1 Mpc, while the 
dotted line shows the best fit single power-law model over 1 < s < 
100.0 h~ 1 Mpc. The lower panel shows the £(s) behaviour near 
zero on a linear scale. The quoted errorbars are jackknife errors 
from the diagonal elements of the covariance matrix. 

This expression should be valid at smaller scales where 
the number of pairs is small and most pairs are indepen- 
dent (i.e. few quasars are involved in more than one pair; 
IShanks k Bmdel Il99l ICroom fc Shanks! Sa p). How- 
ever, as reported in iMvers et al.l (|2005h and IRoss et al.l 
(|2007f) . the Poisson error under-estimates measurement 
error when compared to e.g. the field-to-field or Jack- 
knife errors at larger scales, where quasar pairs are not 
independent. For this work, we will not report any field- 
to-field errors, but instead concentrate on a jackknife re- 
sampling procedure in order to calculate the full covari- 
ance matrix, from which we will use just the diagonal 
elements. Full details of the jackknife procedure, includ- 
ing the geometry of the subsamples used and the justifi- 
cation for using only the diagonal elements are given in 
Section 4 and Appendix B. 

4. RESULTS 

4.1. SDSS Quasar Redshift- Space Two-Point 
Correlation Function, £(s) (t).30 < z < 2.2) 

The two-point redshift-space correlation function for 
the SDSS DR5Q UNIFORM sample over the redshift in- 
terval 0.3 < z < 2.2 is given in Figure [3] As described in 
Appendix B, the errorbars are jackknife errors from the 
diagonal elements of the covariance matrix, i.e. af = Cu. 
We justify this approach by considering that the covari- 
ance matrix is close to diagonal (Fig. [TB)) and using just 
the diagonal elements of the covariance matrix produces 
results very close to that using the whole matrix, when 
fitting out to 25 h^ 1 Mpc. The off-diagonal elements of 
the covariance matrix are too noisy to be useful at large 
scales, and we therefore only use the diagonal elements 
in all the fits and plots that follow. 

We start by fitting a simple, single power-law model 
of the form in Equation [5J We find that a single 
power law with a redshift-space correlation length of 
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Fig. 4.— The Quasar redshift-space 2PCF, £(s), from the UNI- 
FORM sample as in Fig. [3] Also shown are the red shift-space 
correlation functions from the 2QZ (Croom ct al. 200g), shown as 
cerulean filled squares connected with a dotted line, and the 2SLAQ 
QSO survey (da Angela et al. 2008), shown as the red filled squares 
connected by the dashed line. There is excellent agreement between 
the three surveys. 

s = 5.95 ± 0.45 h' 1 Mpc and power-law slope of 
provides an adequate description of the 



= 1.16 



+0.11 
-0.08 



data over the scales 1.0 < s < 25.0 h^ 1 Mpc (solid line, 
Fig. [3]). Here a value of x 2 = 11.5 is obtained with 
11 degrees of freedom (dof) giving P, the probability 
of acceptance (of our power law model to the data) of 
0.402. A less suitable fit is found at larger scales due 
to the data falling below the power law. Over the range 
1.0 < s < 100.0 hr 1 Mpc, the best fit model has a similar 
correlation length, sq = 5.90±0.30 h^ 1 Mpc but a signif- 
icantly steeper power-law slope, 7 S = 1.57^qq5 (dotted 
line, Fig. [3]). The % 2 for this model is 32.8 with 15 dof 
and P = 5 x 10~ 3 . The data systematically deviate from 
the power-law fit, possibly due to the effects of redshift- 
distortions (on small scales), with a "flattening" of the 
data compared to the model at small, s < 5 h^ 1 Mpc, 
scales and a steepening at large, s > 40 h~ x Mpc, scales 
- though a decline below a power-law at large scales is 
also expected from linear theory via the CDM real-space 

Or). 

In Figure |4] we compare our results with the redshift- 
space correlation function £(s) from two other recent 
studie s, the 2QZ (|Croom et all [20051 ) and the 2SLAQ 
QSO (|da Angela et al.l [2008h surveys. The analysis by 
Ida Angela et all {2008) uses data from both the 2QZ and 
2SLAQ QSO surveys and thus the samples are not com- 
pletely independent. 

The 2QZ and 2SLAQ QSO surveys both cover very 
similar redshift ranges to our z < 2.2 sample. The 2QZ 
covers a much smaller area, rs 750deg 2 , than the SDSS 
but has 2/3 as many quasars as our sample, since it 
reaches to a deeper limiting magnitude of 6j = 20.85 
(corresponding to g « 20.80 and i « 20.42). The 2SLAQ 
QSO survey has a smaller area yet, ~ 180 deg 2 , and 
reaches a magnitude deeper than the 2QZ to g = 21.85 



(i w 21.45) resulting in 8 500 quasars with 0.3 < z < 2.2. 

The agreement in the correlation function between sur- 
veys over 1 hr 1 Mpc < s < 100 hr 1 Mpc scales is impres- 
sive but not necessarily unexpected, since we are essen- 
tially sampling the same type of objects i.e. luminous 
AGN, powered by supermassive black holes accreting 
at or near their Eddington limits (|Kollmeier et al.l 120061 : 
IShen et aT1l2008D . quite possibly in similar mass environ- 
ments (see Section 5). However, the samples have differ- 
ent luminosities, with mean Lb i.sdss — 3.4 x 10 46 erg 
s^ 1 (Tabled compared with mean £bo1,2QZ ~ 1-3 x 10 46 
erg s" 1 (assuming M^j = —24.6 and eqn. 27 from 
ICroom et aHl2005l for the 2QZ QSOs), suggesting that 
variation in quasar luminosity is due to a variation in 
SMBH fueling, rather than a variation in SMBH mass 
(which maybe correlated to halo mass). We explore this 
luminosity de pendence on clust ering further in the com- 
panion paper (|Shen et al.ll2009h . 

Figure [5] displays the very large scale £ (s) using the 
LS estimator. We see that apart from one data point at 
s « 400 h^ 1 Mpc, the redshift-space correlation func- 
tion is within la of £(s) = at scales greater than 
~ 300 hr 1 Mpc. A x 2 test comparing the data to 
£(s) = over the range of 100 < s < 1000 ft," 1 Mpc 
and 100 < s < 3000 hr 1 Mpc gives % 2 = 8.2 (18 dof, 
P = 0.975) and X 2 = 25.3 (54 dof, P = 0.999), respec- 
tively. Our rms scatter is ±0.001, which compares well 
to the 2QZ value of ±0.002; with a sample ~ 50% larger, 
we have roughly doubled the pair counts at these very 
large scales. The dimensions of our sample do not allow 
us to probe separations beyond 3000 hr 1 Mpc. 

4.2. SDSS Quasar 2-D 2-Point Correlation Function, 
£(r p ,7r) f0.30 < z < 2.2) 

Figure [6] shows the SDSS DR5 Quasar 2-D redshift- 
space correlation function £,{r p ,ir) for the UNIFORM 
sample, over 0.3 < z < 2.2. The redshift-space distor- 
tions in the clustering signal - seen as deviations from 
isotropy - are immediately apparent. At small r p , the 
random peculiar motions and redshift errors of quasars 
cause an elongation of the clustering signal along the linc- 
of-sight directio n, ir. This is the well-known "Fingers- 
of-God" effect ([Jackson! 119721 ) . Cosmological informa- 
tion can be extracted from the Quasar 2D £(r p , tt) mea- 
surement (e.g. iHovle et all 120021 : Ida Angela et al.l 120051 . 
2008). However, full treatment of the separation of the 
effects of large-scale 'squashing' in r p (used to determine 
P(z) in equation 1 1 2() and the substantial contribution 
from the Fingers-of-God at small scales is left to a fu- 
ture paper. 

4.3. SDSS Quasar Projected 2-Point Correlation 
Function 

In Figure [3 we show the projected 2-point correlation 
function, w p {r p ), calculated using equation [9] The re- 
ported error bars are jackknife errors, using the same 
jackknife area subsamples as for the £(s) calculation 
(Appendix B). Since we are fitting power laws of the 
form £(r) = (r/ro) 7 (equation IT0| . we plot w p (r p )/r p 
on the ordinate. We find the best fitting single power- 
law to the SDSS Quasar w p {r p )/r p data to be ro = 
5.45±g;|^ h- 1 Mpc and 7 = lJOj^os over our full range 
of scales, 0.1 < r p < 130.0 h^ 1 Mpc. This provides a 
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Fig. 5.— The SDSS Quasar redshift-space 2PCF, f(s), for our 
scales. Jackknife errors are plotted. The data are consistent with 
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Fig. 6. — The SDSS DR5 Quasar £(r p , w). The contours give lines 
of constant £ having <51og£ = 0.4 between contours, with log£ = 
1.6 the highest value at the centre of the plot. The thick contour is 
£ = 1.0. The actual f (r p , it) measurement is repeated and mirrored 
over four quadrants to show the deviations from circular symmetry. 

somewhat poor fit, giving a value of x 2 = 22.02 with 12 
degrees of freedom (P = 0.038). We remind the reader 
that due to fibre collisions, measurements at scales of 
r v S 1 ^ 1 Mpc are biased low (Sec. IC.6[) . Restrict- 
ing the range to 4.0 < r p < 130.0 h^ 1 Mpc, we find 
the best fit power-law has an increased real-space corre- 
lation length of r = 8.75+o;5o h^ 1 Mpc and a steeper 
slope of 7 = 2.40^q'5q- This power-law is a more accept- 
able fit, having X 2 = 3.47 with 6 dof (P = 0.748). We 
further suggest that the difference between the fitted re- 
sults and their dependence on scale is due to a "break" 
in the w p (r p )/r p measurements at r p ~ 2 — 5 hr x Mpc. 
However, we are hesitant to offer an explanation of this 
behaviour of our measurements in terms o f, e.g. the tran- 
sition from the 1 to 2-halo regime (cf. iPorciani et al.l 
12001 . 

Compari s ons o f our w p (r p )/r p results to those of 
IShen et all (|2007f ) for the z > 2.9 redshift quasar mea- 
surements shows that the high redshift SDSS quasars 




Fig. 7.— The SDSS Quasar redshift-space 2PCF, w p (r p ). The 
dashed line shows the best fit single power law to the data over our 
full range of scales, 0.1 < r p < 130.0 Mpc. Here, the real-space 



correlation length is ro = 5.45 



1.90 



+0.35 



hr 



Mpc with a slope 7 



+0.04 
0.03- 



Restricting the range to 4.0 < r p < 130.0 h 1 Mpc, the 



-,+0.07 



best-fit values become ro = 8.75^q' 50 h 1 Mpc and 7 = 2.40 
The lower panel shows the ratio of the data divided by the power- 
law model over 4.0 < r p < 130.0 ft -1 Mpc. 

have a much larger clustering amplitude than the lower 
redshift sa mple. The consequ ences of this are discussed 
in detail in lShen et all (j2007f ). 

4.4. Evolution of the SDSS Quasar Correlation 
Function 

Figures [5] and [5] present the evolution of the redshift- 
space, £(s), and the projected, w p (r p ), 2PCF, using the 
SDSS DR5 UNIFORM Quasar sample. 

We plot both £(s) and w p (r p ) for sub-samples of the 
UNIFORM data, with the relevant redshift limits given 
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Fig. 8. — The SDSS DR5 Quasar redshift-space 2PCF, £(s), and its evolution with redshift. All panels have the same scaling with the 
respective number of quasars, Nq, in each redshift range given. The thin (black) line in each panel is £(s) for the full DR5Q UNIFORM 
sample, over 0.30 < z < 2.20. The quoted errorbars are Poisson (see text for justification). 



in Table O Here we choose the redshift slic es so that we 
match those of the 2QZ Survey given by ICroom et al.l 
( 2005). Our survey generally has 50% more data in each 
redshift bin. However, since the 2QZ selects QSO candi- 
dates on the basis of their stellar appearance on photo- 
graphic plates, low-redshift quasars with detectable host 
galaxies on the plate are preferentially rejected from the 
final 2QZ catalogue, and the SDSS Quasar UNIFORM 
sample has a larger proportion of low, z < 0.5, redshift 
quasars 8 . We fit power-law models of the form given by 

8 The larger number of low redshift quasars in the SDSS sample 
is also at least in part due to the contribution of the Ha emis- 
sion line in the i-band, as well as host galaxy contribution at low 
redshift. 



equation @, over the ranges 1.0 < s < 25.0 h^ 1 Mpc 
(except for our 2.02 < z < 2.20 bin, where to get finite 
constraints, we fit to s max — 100 hr 1 Mpc). The best 
fit parameters and corresponding la errors are given in 
Tabled 

In Fig. [51 we show measurements for £(s) for the red- 
shift slices. The measurement of £(s) for the full red- 
shift range measurement is given by the thin line in 
each panel. We show Poisson errors as these are ap- 
proximately equal to jackknife errors on scales where the 
number of DD pairs is less than the number of quasars 
in the (sub)sample (see Fig. [T71 and Appendix [Bj) . This 
scale is s ~ 40 — 80 hr 1 Mpc for the sub-samples given 
here. As such, the errorbars on scales > 80 h~ x Mpc are 
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Fig. 9. — The SDSS DR5 Quasar projected 2PCF, w p (r p ), and its evolution with redshift. The solid line in each panel is w p (r p )/r p for 
the full DR5Q UNIFORM sample, over 0.30 < z < 2.20. The quoted errorbars are scaled jackknifes (see text for details). The relevant 
power-law fits as given in Table [2] are shown by the dotted lines. 



most likely under-representative. The £(s) data show a 
trend to 'lose' quasar-quasar DD pairs at the smallest 
separation, as the redshift increases. Keep in mind that 
the length scale suppressed due to the 55" fibre collision 
limitation increases from s ~ 0.2 h' 1 Mpc at z = 0.5 
to s ~ 1 ft -1 Mpc at z = 2 (Fig. [23]) . giving rise to the 
apparent depression in the correlation function on small 
scales. 

Fig.[S](tWp (r p )/r p ) has the same format as Fig.[5J How- 
ever, here we show scaled jackknife errors, scaled using 
the w p (r p )/r p Poisson and Jackknife error measurements 
from the full sample. As can be seen from inspection, the 
errorbars plotted here have generally larger magnitudes 
than the spread of the data alone. As such, this leads to 



questioning whether this is due to the Poisson errors be- 
ing general under-estimates or the jackknifes being over- 
estimates of the true error 9 . 

As a check, we calculate the "summed variance" Pois- 
son errors, that is, we sum the variances of each bin in- 
cluded in the integral for w p (r p ). This method returns 
smaller errors than those shown in Fig. [51 especially at 
the smaller, r p < 10 h" 1 Mpc, scales. Re-assuringly, 
when we come to fit single power-law models for £(r) 
(Sec. 5.2) in order to find values for the real-space 
correlation length, ro, the best-fit values we find from 

9 The interested reader is pointed towards recent work by 
INorberg etaLl J2008f l. who use large TV-body simulations to investi- 
gate different error estimators and the 2PCF for galaxy clustering. 
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Evolution of the real-space correlation length 
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Note. — Evolution of the redshift-space, so, and real-space, ro, correlation lengths. For so, both the correlation length 
and power-law slope were allowed to vary. All redshift-space subsamplcs were fitted over the range 1.0 < s < 25.0 h' 1 Mpc, 
unless otherwise noted with a , where the range was 1.0 < s < 100.0 Mpc. For so we quote values both with floating 
and fixed (73 = 1.16) power-laws. For the full sample, ro and 7 are allowed to vary and fits were performed over the scales 
1.0 < r p < 130.0 h~ 1 Mpc. While for the real-space subsamples, the calculation of ro was made by by fitting our wp(rp)/rp 
measurements using equation 1101 over the 1.0 < r p < 130.0 h —1 Mpc, w hile keeping the power-law index fixed at 7 = 2.0. 
The bolometric luminosities are from the catalogue of Shcn ct al. (2008). 



using the "summed variance" errors are in good agree- 
ment with those found using our w p (r p )/r p "averaging" 
method quoted in Table [2] We explicitly note though 
that there still could be an issue with the jackknife er- 
rors being too large (for w p (r p )/r p ) currently for reasons 
unknown. 

Figure [10] shows the evolution of the redshift-space cor- 
relation length, so, with both redshift and the age of the 
Universe (adopting the cosmology given at the end of 
Section 1). Since there is a covariance between the best- 
ht so an d 7s, here we fix 7^ to the best- fit value of the 
whole sample (7,5 = 1.16) and then measure the best-fit 
so- We find the clustering strength remains reasonably 
constant with redshift out to z ~ 3, which is equivalent to 
approximately 80% of the history of the Universe. This 
trend was also seen in Fig. [5] The correlation length is 
measured to be so = 5 — 7 h^ 1 Mpc for bright, optically 
identified quasars in the SDSS, up to z ~ 3. 

5. EVOLUTION OF GALAXY, AGN AND QUASAR 
CLUSTERING 

5.1. The Redshift- Space Evolution 

In Figure 1101 we compare our measurements of the 
evolution of the redshift-space correlation length, sq, to 
those recently published in the literature. We calculate 
our values for So by fitting our £(s) measurements using 
equation O Motivated by the fits in Fig. [31 we hold the 
power-law index fixed at 7 S = 1.16. The study of quasar 
clu stering most compar able to our own is that presented 
by ICroom et al.l (|2005l) for the 2QZ survey. Our study 
using the SDSS DR5Q UNIFORM quasar sample and 
the 2QZ are in very good agreement over the full red- 
shift range, given the associated uncertainties. However, 
in the SDSS DR5Q sample, we see very little, if any, 
evolution in the redshift-space correlation length even to 
z ~ 3, whereas the 2QZ does show marginal evolution 
in so. The similarity of these results again suggests that 
quasar clustering only weakly depends on luminosity for 



the dynamical rang es probed in thes e samples, a topic 
discussed further in IShen et alj ( 20091 ). 

The filled (red) star in Fig. [TO] is from the study by 
IWake et al.1 (|2004f ) who use a sample of 13 605 narrow- 
line AGNs in the redshift range .055 < z < 0.2 from th e 
first Data Release of the SDSS (jAbazaiian et all 120031) . 
They find that the AGN autocorrelation function is con- 
sistent with the observed galaxy autocorrelation function 
over s = 0.2 — 100 h^ 1 Mpc scales. Furthermore, they 
show that the AGN 2PCF is dependent on the lumi- 
nosity of the narrow [O III] emission line (£[0111] )> with 
low i[oni] AGNs having a higher clustering amplitude 
than high £[om] AGNs. This measurement suggests 
that lower activity AGNs reside in more massive DM 
haloes than do higher activity AGNs, as £[ omi provides 
a good indicator of AGN fu eling rate (e.g. iMiller et al.l 
l2003t [Kauffma nn et al.|[2~003l ). As such, it is interesting to 
note that our lowest redshift quasar clustering data point 
is, within t he uncertainties, c onsistent with the measure- 
ment from lWake et al.l (|2004j ) . We use the term 'quasar' 
here loosely, as for our lowest redshift bin, the mean bolo- 
metric luminosity is 1.6 x 10 45 ergs s _1 , a factor of 20 
lo wer than our full samp l e (Tab le [2|) . 

IConstantin fc Vogelevl (|2006[ ) study the clustering of 
specific classes of AGN, namely Seyert galaxies and 
LINERs (low-ionization nuclear emission-line regions) 
with the classes being separated on the basis of 
emiss i on-line diagnostic d iagrams (e.g. iBaldwin et al.l 
[19811: iKewlev et all l200l . They find that LINERs, 
which show the lowest luminosities and obscuration lev- 
els, exhibit strong clustering (s = 7.82±0.64 h~ x Mpc), 
suggesting that these objects reside in massive haloes 
and thus presumably have relatively massive black holes 
that are weakly active or inefficient in their accretion, 
potentially due to the insufficiency of their fuel sup- 
ply. Seyfert galaxies, however, have lower clustering, 
s = 5.67±0.62 h~ l Mpc (Fig.[TJJl blue triangle), are very 
luminous and show large emitting gas densities, suggest- 
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The (blue) filled triangle is the clustering measurement of Seyfert galaxies from Constantin & Vogclcv (2006). 



ing that their black holes are less massive but accrete 
quickly and efficiently enough to dominate the ioniza- 
tion. Therefore, based on our lowest redshift clustering 
results, the stronger link for our low-luminosity 'quasars' 
is to Seyfert galaxies rather than LINERs. 

5.2. The Real-Space Evolution 

In Figure [TT1 we compare our measurements (black cir- 
cles) of the evolution of the real-space correlation length, 
ro, to those recently published in the literature. We cal- 
culate our values for ro by fitting our w p (r p )/r p measure- 
ments using equation [TQl calculating an ro value at each 
separation where w p (r p )/r p is non-zero, and reporting 
the standard error on the mean for these values in Ta- 
ble [H Motivated by the fits in Fig. we hold the power- 
law index fixed at 7 = 2.0, thus setting ^(7 = 2) = tt 
( eqn. [T0|) . We caution again however, that as can be seen 
from inspecting Fig. [9j the scatter in the points is small 
compared to the quoted errorbars, and thus, this method 
may well under-estimate the errors associated with the 
r eal-space corr el ation length. 

iMvers et all (|2006l ) reported a measurement of 
the clustering of quasars using ~ 80 000 SDSS 



qu asars photomet r ically classified from the catalogue 
of iRichards et all (|2004T ). The ro measurements from 
IMvers et al.l (|2006h are given by the filled (red) squares 
in Fig. 111! and are in very good agreement with our own 
data (we plot the data from their Table 1, from the 'De- 
projected ro' section and the 0.75 < r < 89 h" 1 Mpc 
r ow). 

ICoil et all (|2007l ) calculate the cross-correlation be- 
tween ~ 30 000 redshift 0.7 < z < 1.4 galax ies observed 
as part of th e DEEP 2 galaxy redshift survey (|Davis et al.l 
[20011 12003T) . and quasars over the same redshift range. 
In total there are 36 SDSS quasars and 16 quasars iden- 
tified from the DEEP2 survey its e lf ove r the 3 deg 2 cov- 
ered by the DEEP2. ICoil et alj (|2007l) find that r ~ 
3.4±0.7 Mpc for the quasar-galaxy cross-correlation 
(£qg)- These authors measure ro ~ 3.1 ± 0.6 hr 1 Mpc 
for the inferred quasar clustering scale length, assuming 
that 7 is the same for the galaxy and the quasar samples 
and the two samples trace each other perfectly, giving 
£qg = \J^qq x £gg- We show this measurement as an 
open (purple) diamond in Fig. 111! Althoug h still consis- 
tent with the low-redshift measurement of IMvers et al.l 
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Fig. 11. — Evolution of the real-space correlation length, ro, up to redshift z = 2.9. The filled (black) ci rcles are from the D R5Q 
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(I2006I) . it is at odds with our measurements. Determi- 
nation of £qq from the cross-correlation measurement 
assumes that the density fields traced by the galaxies 
and quasars, Sg and 5q respectively, are perfectly corre- 
lated spatially, i.e. t he correlation coefficient between the 
two i s r = +1 (e.g. iBlanton et aT] [T999; Swanso n et al.l 
2008) 10 . Thus, as is quite plausible, if z ~ 1 quasars and 
galaxies sample the underlying mass density field differ- 
ently, then one can reconcile the difference in correlation 
lengths by invoking a correlation coefficient that is mod- 
e stly different from unit y. 

lAdelberger fc Steidell ( 2005a. b) studied the clustering 
of Lyman Break galaxies (LBGs) around 2 < z < 3 AGN. 
The dynamic range in luminosity for this sample is nearly 

10 The simplest and frequently assumed relationship between S\ 
and S2 is "deterministic linear bias", 5i = 6u n 52 where by ln is a 
constan t para m eter, <5i = pi(x)/pi — 1 and 82 = P2(x)/p2 — 1, e.g. 
IPeeblesI lTl980T ) : iDekel fc La hav (1999); Swanson ct al. (200l). 



10 ma gnitudes (16 < Gab < 26, lAdelberger fc Steidell 
2005b) and is thus much greater than for our SDSS DR5 
UNIFORM sample. These authors report a value of 
r = 5.27±i || h- 1 Mpc for a sample of 38 AGN with 
central SMBH masses of 10 5 8 < M BH /M < 10 8 and 
r Q = 5.20±^;f| h^ 1 Mpc for a sample of 41 AGN with 
10 s < Mbb/Mq < 10 10 - 5 . If we assume the correlation 
coefficient is r = 1 and the power-law slopes are con- 
stant between samples, we f ind (with t^lbg-l bg = 4.0± 
0.6 hr 1 Mpc at z = 2.9, lAdelberger et al.l 120051 ) that 
agn— agn ~ 6-9 bT 1 Mpc. This result is very broadly 
consistent with iMvers et al.l (|2006l) but in tension with 
our SP SS DR5 UNIFORM results. lAdelberger fc Steidell 
(|2005bf ) sample vastly different luminosity ranges than 
we do and find the clustering does not vary significantly 
with luminosity, immediately ruling out luminosity de- 
pendence as an explanation of the different clustering 
amplitudes. Again, the assumption of perfect correla- 
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tion is called into question, with a non-unity correlation 
coefficient r pot entially reconciling both these and the 
iCoil et alJ (I2007T) DEEP2 results. 

We also compare our results with clustering mea- 
surements ol recent deep X-ray surveys, which are 
particularly well suited to finding intrinsically less lu- 
minous, potentially obscu red objects at high redshift 
(|Brandt fc Hasingerl |2005l ). An immediate caveat we 
place in the following comparison is that the SDSS DR5Q 
surveys ~ 4000 deg 2 , while the largest solid angle of the 
current deep X-ray surveys is of order 1 deg 2 and there- 
fore the X-ray results are much more susceptible to cos- 
mic variance. 

iBasilakos et~all (j2004j) estimate ro using the angular 
autocorrelation function, w(0), of hard, (2-8 keV) X-ray 
selected sources detected in a w 2 deg 2 field using a shal- 
low (/x[2-8 keV] s=s 10~ 14 ergs cm~ 2 s" 1 ) and contiguous 
XMM-Newton survey. The area surveyed consisted of 13 
usable pointings, overlapping that of the 2QZ survey, and 
resulted in the detection of 171 sources. Various models 
for th e redshift distribution are given inTBasilako s et all 
(|2004 see their Table 1); for our comparison, we adopt 
the r values calculated using (0^,0^) = (0.3,0.7), 
which either assu me "pure luminosity evolution" (PLE, 
iBovle et al.lll998l) or "luminosity-d ependent density evo- 
lution" (LDDE. lUeda et al.ll2003T ). As such, the PLE 
and LDDE models produce different mean rcdshifts of 
z = 0.45 and z = 0.75 , respectively, for the AGN sample. 
IBasilakos et~aT1 (|2004h find r = 9.0±2.0 hr 1 Mpc for the 
PLE model and r = 13.5 ± 3 h,- 1 Mpc for the LDDE 
model, fixing the power-law slope at 7 = 2.2. These 
observations are given by filled (dark red) pentagons in 

F ig, an 

iGilli et ail (|2005l ) obtained a sample of nearly 260 
AGN in the Chandra Deep Field Nor th (CDF-N, 
lAlexander et all I2003L jgarger et all l2003h and South 
(CDF-S, iRosati et al.ll2002l ) with spectroscopic redshifts. 
They report that in both fields the AGN have z ~ 0.9 and 
a median 0.5-10 keV luminosity of Lx ~ 10 43 erg s _1 , 
i.e. in the local Seyfert galaxy luminosity regime. Cor- 
relation lengths and slopes of ro = 5.5 ± 0.6 h~ x Mpc, 
7 = 1.50 ± 0.12 and r = 10.3 ± 1.7 hr 1 Mpc, 7 = 
5.5 ± 6 are found for the CDF-N and CDF-S respec- 
tively (jGilli et all 12005. their Table 2), shown as filled 
( green) stars in Fig . [TT1 

iMivaii et al.l (|2007f) measured the angular autocorrela- 
tion function of X-ray point sources detec ted by XMM- 
Newt on in the - 2 deg 2 COSMOS field (|Scoville et all 
120071) . The measurements for the 0.5-2 (SFT), 2-4.5 
(MED) and 4.5-10 (UHD) keV bands are giv en by filled 
(green) triangles in Fig. [TTJ IGilli et ail (|2008f ) also report 
on the spatial clustering of AGN in the COSMOS field 
using ~ 550 spectroscopically identified AGN at a me- 
dian redshift and 0.5-10 keV luminosity of z — 0.98 and 
Lx = 6.3 x 10 43 erg s _1 respectively. They find a value 
of r = 8.65 ±0.5 h~ x Mpc (Fig. [Til open green triangle) 
and a power-law slope of 7 = 1.88 ± 0.07. However, this 
result is affected by a coherent structure of 40 AGN at 
z ~ 0.36. Removing this structure causes ro to drop to 
~ 6 h" 1 Mpc, similar to that of the previous deep X-ray 
AGN measurements. 

We find that our clustering measurements are in good 
agreement with the lower correlation lengths found by 
some of the deep X-ray surveys, e.g. IGilli et aTl (2005) 
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< z > 




l P {r,z) 


b 




1.27 


0.391 ±0.011 


0.069 


2.06 ±0.03 


0.24 
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0.176 
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0.49 
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0.104 
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0.049 
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0.674 ±0.166 


0.043 


3.62 ± 0.49 
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0.425 ± 0.442 


0.039 


2.99 ± 1.42 



Note. — The evolution of the linear bias for the SDSS 
Quasar UNIFORM sample. 

for the CDF-N. IMivaii et all (I20 07T) for their MED (2-4.5 
keV) band and XMM-COSMOS (jGjlli_e£in[2003) How- 
ever, there remains much scatter in the deep X-ray data, 
potentially due to cosmic variance and the small samples 
us ed for these analyses. T hus, we use the method given 
bv lSomerville et aTl (|2004l ) to estimate the "relative cos- 
mic variance", <r 2 = — rm*-^ T^y, where (N) is the 

mean and {N 2 } the variance of the probability distribu- 
tion function Pjv(V), which represents the probability of 
counting N objects in volume V. The second term here 
is the correction fo r Poisson shot no ise. For the XMM- 
COSMOS study bv lGilli et ail (|2008| ) we assume that the 
COSMOS area is 2 deg 2 and theref ore the volume is ~ a 
few xlO 7 h' 1 Mpc (from Fig. 1 in lScoville et al.ll2007f) . 
We also as s ume a number density of 1.8 x 10~ 4 ft. 3 Mpc -3 
IGilli et all (poM Fig. 9) for the COSMOS-XMM AGN, 
and a redshift of z = 1, even though the range is known 
to be much wider. This gives a bias of b ~ 2.2 according 
to Fig. 3 of ISomerville et al.l (|2004D . (Interestingly, this 
is very much in line with with what we find in the next 
section for the SDSS quasars at this redshift). Thus, the 
°« cosmos ~ 4 x 10~ 3 , which suggests that the COS- 
MOS survey shouldn't be dominated by cosmic variance. 
However, we note that this value does not take into ac- 
count the Poisson shot noise term, 1 / (TV) , which is likely 
to be significant considering the relatively small number 
(~500) of objects in the XMM-COSMOS sample. For 
the Chandra Deep Fields, the cosmic variance is much 
greater due to the fact that the areas are (at least) an 
order of magnitude smaller. (CDF-N is 0.13 deg 2 ; CDF- 
S is 0.04 deg 2 ). This could well explain the difference 
between the two CDF clustering measurements (as has 
been discussed in the relevant studies). We also note 
that t he AGN-LBG stud y comes from an area of ~ 0.5 
deg 2 (jSteidel et al.l 120041 ) and so cosmic variance could 
potentially be an explanation for the difference seen in 

Fig.rrjj 

5.3. Evolution of Bias 

One key reason for measuring the correlation function 
as a function of redshift, £(s, z), is to determine the lin- 
ear bias, b, defined by the model of equation JTJ. We 
shall assume that b is independent of scale on the scales 
and redshift range under investigation here. 11 We follow 

11 The precise way in which galaxies/luminous AGN trace 
the underlying matter distribution is still poorly understood. 
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Fig. 12. — Evolution of the linear bias of quasars, 6q, with redshift, to z = 3. The (bla ck) circles, are from the SDSS DR5Q UNIFORM 
sample (this work); the (red) squares, from the photometric SDSS quasar measurements (Myers ct al. 2006); the (green) squares from the 
2QZ survey (Croom ct al. 2005); the (black) stars are from the 2SLAQ QSO survey (da Angela ct al. 2008); The solid lines give dark halo 
masses from the models of Shcth ct al. (2001) with \ogh~ 1 Mq = 12.6, 12.3 and 11.7 from top to bottom. The dotted lines give dark halo 
masses from the models of Jing (1998) with \ogh~ 1 Mq = 12.3, 12.0 and 11.7 from top to bottom. 



the m ethod in lCroom et al.1 (|2005l ) and Ida Angela et aU 
( 2008) to determine b using our redshift-space correlation 
function £(s, z) measurements from Section 4. 

In order to minimize non-linear effects e.g. redshift- 
space distortions, we shall use the volume-averaged cor- 
relation function, £, defined as 



fir *™?ww 



(s 



3 

max 



(14) 



(15) 



in practice. Un- 
is always chosen 



where s m i„ = 1.0 h Mpc is set 
less explicitly stated otherwise, s m ax 
to be 20 hr 1 Mpc, so that non- linear effects in the 
sample should be small due to the s 2 we i ghting and 
for ease of comparison with iCroom et aU (|2005l ) and 



IBlanton et a l. (2006), Schulz & White] | |2006T) , ISmith et all (|2007|) 
and IColcs & Erdogdu (2007) all suggest that bias is potentially 
scale dependent. We do not take this into account in the current 
analysis. 



Ida Angela et all (|2008l ). In the linear regime, the z- 
space and real-space correlation functions can be given by 
equation (fTTj) . Thus we combine equations (fT|) and (TIT) , 
and recognise that (3 = f2 m 55 /6 leaves us with a quadratic 
equation in b. We are assuming a flat, cosmological- 
constan t model and hence the effective exponent of Q m 
is 0.55 (lLinddl200l IGuzzo et~ai1l2008f) rather than 0.6, 
suggested bv lPeeblesI ( 19801) . although we find this makes 
virtually no difference to our bias measurements. Solving 
the quadratic in b leads to 



b(z) 



l £ Q (s,z) ASl^jz) n^(z) 
£ p (r,z) 45 3 



(16) 



We now use our measured £q(s, z) together with a the- 
oretical estimate of £ p (r, z) and £l m (z) to determine the 

bias. 

To estimate £ p (r, z), we follow iMvers et all (|2007f) and 



Ida Angela et all fl 2Q08|), and use the non-linear estimate 
of P(k) giv e n by ISmith et all (|2003l ). The models of 
ISmith et all ((20031) predict the non-linear power spec- 
trum of dark matter for a range of CDM cosmologies over 
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a wide range of scale. We thus Fourier transform these 
P(k) models and integrate over s = 1 — 20 h^ 1 Mpc to 
compute £ p (r,z). The cosmological parameters used in 
our chosen model are £l m (z = 0) = 0.3, = 0) = 0.7, 

T = 0.17 and ct 8 = 0.84. We find the simple form, 



f p (r, z) = [Aexp(Bz) + C\£ p {r, z = 0) 



(17) 



where A = 0.2041, B = -1.082, and C = 0.018 models 
the evolution of £ p (r, z) extremely well, for 1 h^ 1 Mpc < 
s < 20 h' 1 Mpc. 

At the mean redshift of our survey, £l m {z = 1.27) = 
0.81, we find b Q (z = 1.27) = 2.06 ± 0.03 from the full 
SDSS DR5Q UNIFORM sample. The values for our red- 
shift sub-samples are shown as filled circles in Fig. [12] 
and are given in Table j31 We estimate our errors by 
using the variations in £(s) from our 21 jackknife esti- 
mates, scaled using the number of DD pairs in each red- 
shift slice subsample. Previous measurements from the 
2QZ Survey (filled green circles, ICroom et al.ll2005T ). the 

2SLA Q QSO Survey (open black stars, Ida Angela et al.l 
2008) and ph otometrically select ed SDSS quasars (filled 
red squares, IMvers et all l200l are again in excellent 
agreement with our data. We compare these bias esti- 
mates with various models in Section 5.4. 

Having measured b(z) and assuming a cosmological 
model, we can infer the parameter (3{z) using equa- 
tion 1121 The space density of quasars is much smaller 
than that of galaxies, so the errors on the clustering 
measurement (e.g. £(r p ,7r)) are much larger than for 



galaxy surveys (cf. lHawkins et al. | 120031 : IZehavi et al.l 
2Q0$M~ 



iRoss et aljf2007t iGuzzo et al.ll2008l ). Furthermore, 
as discussed in Section 4.2, we have not included the 
effects from the "Fingers-of-God" in the present cal- 
culation of j3(z) but the peculiar velocities at small 
(transverse r p ) scales will very stron gly affect the mea- 
sured redshift distortion value of j3 (jFisher et al.lll994l 
Ida Angela etall [2005h . With b{z = 1.27) = 2.06 ± 0.03 
and n m (z = 1.27) = 0.81 we find (3{z = 1.27) = 0.43, 
but for the reasons given above we present no formal er- 
ror bar. This result is consistent with the values of (i{z), 
measured from redshift-space distortions in the 2QZ sur- 
vey, (3{z = 1.4) = 0.45±Hi llOutram et al.ll200l and 
0(z = 1.4) = 0.50±g;^ (Ida Angela et al.lf2Q05h . 

5.4. Models of bias and dark matter halo mass 
estimation 

We now compare our bias measurements with those of 
recent models for the relationship of quasars to their host 
haloes. 

The fitting formula of Uingj (JI998), which is derived 
from iV-body simulations and assumes spherical collapse 
for the formation of haloes, is plotted in Fig. fTS] (dashed 
lines) with the assumed halo masses (top to bottom) 
Mdmh = 2.0 x lO^h-HlQ, 1.0 x 10 12 ft- 1 M a and 5.0 x 
10 11 /i- 1 M o , respectively. With the Uingj (|1998[ ) model, 
we find the halo mass at which a 'typical SDSS quasar' 
inhabits remains constant (given associated errors) with 
redshift, at a value of a M D mh ~ 1 X l0 12 h~ 1 M Q . 

By incorporating the effects of non-spher ical collapse 
for th e formation of dark matter haloes, ISheth et alJ 
(|2001h provide fitting functions for the halo bias, which 
are also shown in Fig. rT2] (solid lines). Here, the three 
assumed halo masses of (top to bottom) Mdmh = 4.0 x 



lO^h^MQ, 2.0 x IO^/i^Mq and 5.0 x 1O 11 /i" 1 M , 
respectively are pl otted. Comparing our results to the 
ISheth et al.l (|200lD models, we again find the host dark 
matter halo mass is constant with redshift, at a value of 
a Mdmh ~ 2 x 1O 12 /i _1 M0; this mass does not signifi- 
cantly change from z ~ 2.5 to the present day, i.e. over 
80% the assumed age of the Universe. Therefore, as dark 
matter halo masses generally grow with time, the ratio 
of the halo mass for a typical quasar to the mean halo 
mass at the same epoch drops as one approaches redshift 
z = 0. Since the "non-spherical collapse" model is likely 
to be more realistic, and fo r ease of compariso n with pre- 
vious results, we quote the ISheth et al.l (|2001h halo mass 
value from here on. 

Our values of halo masses of Mdmh ~2x 10 12 /i _1 Mq 
found for the SDSS quasars compare very well to those of 
iPadmanabhan et al.l ((2008) . wh o find a simi l ar valu e for 
low (z < 0.6) SDSS quasars. ICroom et all (|2005l ) also 
find a constant, but slightly higher value of Mdmh = 
3.0 ± 1.6 x W 12 h- 1 M Q , by using the ISheth etall (l200lh 
prescription, over the redshift range 0.3 < z < 2.9 
for the 2QZ. Ida Angela et al.l (|2008ft also find M D mh ~ 
3.0 x 10 1 h Mq but recall this analysis us es data from 
both the 2QZ and 2SLAQ QSO surveys. IMvers et aF 



provide halo masses (also using the ISheth et al 
(l200lh prescription) for two cosmologies and we take 
their T = 0.15, as — 0.8 model as this is closer to our 
own assumed cosmology. Again no e volution in the halo 
mass is found from z ~ 2.5, but the IMvers et all (|2007l ) 
value of Mdmh = (5.2± .6) x lO 12 ^" 1 ]^ is ap preciably 
higher than our results. iPorciani et al.l £2004) applying 
a halo occupation distribution (HOD) model to the 2QZ 
data, find Mdmh ~ 1 x 10 13 /i~ 1 M q . This is roughly an 
order of magnitude higher than the values we report and 
indeed at least double that of the other values found in 
the lit erature for luminous quasars. The IPorciani et alJ 
(2004) value is in line with M D mh ~ 1 - 2 X 10 13 /i _1 M Q 
which is the halo mass found for both the most lumi- 
nous quasars or those that are FIRST- detected (i.e. radi o 
loud) in the SDSS DR5Q at z < 2.5 (|Shen et al.ll2009l ). 
Thus we sugg e st som e caution should be taken in the 
IPorciani et al.l (|2004l ) result but note that these authors 
use the halo bias formula from ISheth fc Tormenl (|1999l ) 
which is likely to contribute to some of the discrepancy. 
iShen et a l. ( 2007) find a minimum halo mass of A/dmh = 
2-3xl0 12 h- 1 M e , and M DM h = 4-6 x IO^H^Mq, for 
the very luminous, higher clustered, high redshift SDSS 
quasars at 2.9 < z < 3.5 and z > 3.5 respectively. 

Using semi-analytic models for BH accretion and 
quasar e mission developed on t op of the Millennium Sim- 
ulation (|Springel et al.ll2005f ). iBonoli et all (|2008f ) pro- 
vide a direct theoretical companion work to our obser- 
vational study and that of Shen et al. (2009). These 
authors reproduce our findings that luminous AGNs i.e. 
the SDSS z < 2.2 quasars (with L^ D \ ~ L*), are hosted 
by dark matter haloes with a narrow mass r ange centred 
aroun d a few 10 12 ft- _1 M Q . The results of IBonoli et al.l 
(|2008l e.g. their Fig. 13) might however suggest a 
slightly stronger redshift evolution for the host halo mass 
at z < 2 than is given by our observational data, but this 
is hard to confirm given the associated errors on both the 

observational data and theoretical models ; 

We next compare with the models of I Hopkins et al.l 
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Fig. 13. — Evolution of the linear bias of quasars, 6q, with 
redshift to z = 6. Fil l ed (bl ack) circles, this work; filled (green) 
square s, ICroom et al.l ||2005|); open ( r ed) s quares, IMvers et alj 
p007h : filled (blue) circles. IShen et all $200%; The thin sol id line 
shows the behaviour for all of the three Hopkins ct al. ( 2007]) mod- 
els at i = 20.2, with these models having identical behaviour for 
6(2). The thick solid line shows the "Inefficient feedback" model 
for a magnitude limited survey of i = 30 i.e., a truly "complete" 
survey. The dotted line is for the "Efficient feedback" model (at 
i = 30) and the "Maximal growth" model is given by the dashed 
line (also for i = 30). 



(|2007l e.g. their Fig. 13). Here three models are de- 
scribed for the feeding of quasars. All the models have 
the same z < 2 behaviour, as in each case quasars are 
said to "shut down" , i.e. there is no accretion onto the 
central SMBH a t z < 2. 

The first of the lHopkins et all (|2007l ) models is the "In- 
efficient Feedback" model, whose predictions are given by 
the solid lines in Fig. 1131 Here z ~ 6 quasars grow ei- 
ther continuously or episodically with their host systems 
until the epoch where "downsizing" begins (i.e. z ~ 2), 
Thus, at redshifts z > 2 feedback from quasar activity 
is insufficient to completely shut down the quasar, hence 
the term "inefficient feedback" . 

The second of the I Hopkins et al.1 (j2007) models is the 
"Extreme Feedback" model, represented by the dotted 
line in Fig. [T2J Here each SMBH only experiences one 
episode of quasar activity, after which the quasar com- 
pletely shuts down, even if this occurs at high (z > 2) 
redshifts. BH growth will cease after this one-off quasar 
phase. If objects cannot grow after their quasar epoch 
even at high redshifts, then the subsequent decline of the 
break in the QLF at L = L* traces a decline in character- 
istic active masses, and the linear bias of active systems 
"turns over" . 

The third of the iHopkins et all (|2007f l models is the 
"Maximal Growth" model, represented by the dashed 
line in Fig. [13] In this model the BHs grow mass at their 
Eddington rate until z ~ 2. For example, a ~ 1O 8 M 
BH at z ~ 6 will grow to ~ 5 x 10 9 M Q at z ~ 2 at 
which point the growth ceases and the BH mass remains 
constant. 

The limiting factor in our ability to discriminate be- 
tween models is the dynamic range in luminosity and red- 
shift. We thus extend our redshift baseline up to z = 6 in 



Figure[l3]and now also plot the bias estimates for the z > 
2.9 SP SS quasar clustering measurements of IS hen et al.1 
given by the filled blue circles, where we use their 
measured values of £q(s,z — 3.2) = 1.23 ± 0.35 and 
(;q(s,z = 4.0) = 2.41 ± 0.59 with our equation |Q2J) to 
estimate the bias. 

A magnitude limit of m; < 20.2 is chosen for the mod- 
els to match the SDSS high-rcdshift quasar selection. As 
can be seen in Fig.[l3l all models match the observational 
clustering data well at z < 2. However, at this magnitude 
limit the QLF break lu minosity L* is only m arginally re- 
solved at z ~ 2 — 3 (e.g. lRichards et aT1l2006f k above this 
redshift surveys are systematically biased to more mas- 
sive L > L* BHs with higher clustering and larger linear 
biases. Subsequently, the models with the m t < 20.2 
limit have no discriminating power at z > 2, and the 
predicted behaviour for the linear bias from the "Inef- 
ficient" , "Efficient" and "Maximal Growth" models is 
identical. To break this degeneracy, deeper observational 
data at high redshift will be needed. Fortunately, these 
data should be in hand within the next few years, which 
will be able to discriminate and test these models, such 
as those with an effectively infinitely deep flux limit of 
i = 30 that are also plotted in Fig.[l3l Therefore, further 
investigations into the link between AGN/quasar activ- 
ity, the build-up of SMBH mass and the formation and 
evolution of quasars and galaxies using clustering mea- 
surements are left to future investigations. 

6. CONCLUSIONS 

We have calculated the two-point correlation function 
using a homogeneous sample of 30 239 quasars from the 
Fifth Data release of the SDSS Quasar Survey, cover- 
ing a solid angle of « 4000 deg 2 , a redshift range of 
0.3 < z < 2.2 and thus representing a measurement 
over the largest volume of the Universe ever sampled at 
25 h~ 3 Gpc 3 (comoving) assuming the current ACDM 
cosmology. We find that: 

• The two-point redshift-space correlation function is 
adequately described by a single power-law of the 
form£ = (s/s )" 7 where s = 5.95 ±0.45 h~ l Mpc 
and 7 S = 1.16±g;^ over 1 < s < 25 ft." 1 Mpc. 



• We see no evidence for significant clustering (£(s) > 
0) at scales of s > 100 hr Y Mpc. 

• There are strong redshift-space distortions present 
in the 2-D £(r p ,7r) measurement, with "Fingers of 
God" seen at small scales. However, these are most 
likely primarily dominated by redshift errors. 

• We find no significant evolution of clustering am- 
plitude of the SDSS quasars to z ~ 2.5, though we 
note that the luminosity threshold of the sample 
also increases steadily with redshift and the cluster- 
ing strength does increase at higher redshift. This 
is investigated further in Shen et al. (2009). 

• Comparing our results with recent deep X-ray sur- 
veys, our clustering measurements are in reason- 
able agreement in some cases e.g. Gilli et al. 
(2005), Miya.ji e t al. (2007) and XMM-COSMOS 
I Gilli et ail (|2008f ) but significantly lower correlation 



18 



N. P. Ross et al. 



lengths in others. However, there is still much scat- 
ter in the deep X-ray data, potentially due to cos- 
mic variance and the small samples used for these 
analyses. 

• The linear bias for SDSS quasars over the redshift 
range of 0.3 < z < 2.2 is b(z = 1.27) = 2.06 ± 0.03. 
Using this bias measurement and assuming £l m {z 

1 .27) = 0.81, but not taking into account the ef- 
fects of small-scale redshift-space distortions, we 
find (3{z = 1.27) = 0.43. Both these values are 
consistent with measurements from previous sur- 
veys, i.e. the 2QZ. 

• Using models which re late dark halo mas s to clus- 
tering strength (e.g. ISheth et all 120011 ). we find 
that the dark halo mass at which a 'typical SDSS 
quasar' resides remains roughly constant with red- 
shift at Mdmh ~ 2 x lO 12 ft -1 M . This non- 
evolution of quasar host halo mass agrees very wel l 
with previous studies by e.g. ICroom et al.1 ((20051 ) 
and Ida Angela et al.l (|2008l ). Therefore, as dark 
halo masses grow with time, the ratio of the typical 
halo mass for a quasar to other haloes at the same 
epoch drops with redshift. 

• Using current clustering data, we are unable to dis- 
criminate between the "Inefficient Feedback" , "Ef- 
ficient Feedb ack" and "Maximal Growth" models 
proposed by iHopkins et all (|2007l) at z < 2. The 
measured evolution of the clustering amplitude is in 
reasonable agreement with recent theoretical mod- 
els, although measurements to fainter limits will be 
needed to distinguish different scenarios for quasar 
feeding and black hole growth. 

Shen et al. (2009) study the clustering properties of 
DR5 quasars as a function of luminosity, virial mass, 
colour and radio loudness. 

The SDSS is now complete and the final quasar c at- 
alogue from Data Release 7 (jAbazaiian "etlj1l2"00l is 
being prepared. This catalogue should be a ~ 60% 
increment over DR5, containing about 130,000 quasars 
with spectroscopic observations, and will almost double 
the number of quasars in the UNIFORM sub-sample. 
DR7 will not change the luminosity dynamic range of the 
SDSS quasar survey but with fi nal analysis of data from, 
e.g., the 2SLAQ QSO Survey (jCroom et al.ll2008l ). and 
ext ension of the deep X -ray surveys (e.g. Extended CDF- 
S, iLehmer et al.l [20051 ). connections between the "lumi- 
nous" and "average" AGN luminosity regimes should be- 
gin to converge. 

Looking further ahead, even with the dramatic increase 
in data that surveys such as the 2QZ and SDSS have 
provided, the desire to increase dynamic range contin- 
ues. For instance, due to the steepness of the faint end 



of the quasar luminosity function (|Hopkins et al.l l2007). 
low luminosity quasars should be relatively plentiful, as 
long as one can identify these objects. This will be a 
strong challenge for the next generation of quasar red- 
shift surveys e.g. the Baryon Oscilla tion Spectroscopic 
Survey (BOSS, ISchlegel et all l2007h but one that will 
lead to another significant increase in our understanding 
of quasars, supermassive black holes, galaxy formation 
and evolution and the properties of the Universe. 
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APPENDIX 

A. SDSS TECHNICAL DETAILS 

A.l The Catalog Archive Server 

The SDSS database can be interrogated through the Catalog Archive Server 12 (CAS) using standard Structured 
Query Language (SQL) queries. When querying the CAS, one has a choice to query either the best or target database 

12 http://cas.sdss.org 
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Fig. 14. — Histogram showing the completeness of the DR5 Quasar survey by sector. The solid (orange) line shows completeness 
distribution for all 7814 sectors, while the dotted (blue) line shows the completeness distribution for the 5831 sectors which have one or 
more POAs in them. The summed area of sectors with given completenesses is shown by the dashed line. 



for a given Data Release (in our case, DR5). The former database contains information on all the photometric and 
spectroscopic objects obtained using the la test (and thus the "best") versions of the data reduction and analysis 
pipelines (Section 3, lAbazaiian et alj l2004f ). The target database however, contains the information on objects 
available at the time that the targeting algorithm pipelines were run. An object's magnitude or colour can be subtly 
different between target allocation and the most recent data processing, and some objects change their target selection 
status between the two. More detail s regarding the CAS, best and target are given in t he relevant SDSS Data 
Release papers (|Stoughton et al.ll2002t lAbazaiian et al.ll2004t lAdelman-McCarthv et al.ll2007f l. 

In order to create a statistical data sample, or to mimic it for a comparative 'random' sample, we need to know the 
properties of our chosen objects at the time of targetting, i.e. which objects were selected as quasar candidates. Thus 
here, we only use information from target . 

A. 2 SDSS Survey Geometry 

As mentioned in Section 3.1 and 3.2, in order to calculate the 2PCF, one needs to assemble a "random" sample 
which reproduces the angular selection function ("the mask") and the radial distribution of the quasar data. In this 
section we describe the steps required to define the mask over which our sample was defined. 

The first SQL query we run simply asks the CAS to return all the objects in the Photometric database that were 
targetted as "primary" candidate quasars. When run on DR5, this returns 203 185 objects from the PhotoObjAll 
table. 

We next calculate which primary 'PhotoObjAll' objects (POAs) fall within the spectroscopic survey plate boundaries. 
We do not use any of the "Extra" , "Special" , or "Ext raSpecial" plates for our analysis as these plates were not targetted 
with the normal quasar algorithm, or are duplicates (jAdelman-McCarthv et al1l2006[ ) . There are 145 524 POA objects 
that fall within 1.49 degrees of a given DR5 plate centre, noting that since plates overlap due to the tiling scheme, an 
object can be in more than one plate. 

Of these 145,524 objects, 11,336 are duplicate objects, defined as being within 1" of another object in the catalogue. 
Of these 134,188 unique objects, we would next like to know how many were (a) designated as spectroscopic ("tilable") 
targets by the process of 'Tiling' and (b) allocated fibres. A tile is a 1.49 degree radius circle on the sky which contains 
the locations of up to 592 tilable targets and other science targets (the other 48 fibres are assigned to calibration 
targets and blank sky). For each tile a physical aluminum plate is created. The plates will have holes drilled in them 
for fibres to b e plugg ed, in order to observe the tiled targets. The goal of the tiling procedure, described in detail by 
iBlanton et al.l (|2003f) . is to maximise the total number of targets assigned fibres. Due to the large-scale structure in 
the quasar/galaxy distribution th e proc e dure overlaps til e s with one other. 

As described in IBlanton et all (|2003l ). iTegmark et~aLl (|2004h . IBlanton et all (|2005f ) and iPercival et all (|2007f l. a 
"sector" is defined as a set of tile overlap regions (spherical polygons) observed by a unique combination of tiles and 
survey "chunks". A 'chunk' is a unit of SDSS imaging data and is a part of an SDSS 'stripe', which is a 2.5° wide 
cylindrical segment aligned along a great circle between the survey poles. These sectors are the natural areas on which 
to define the completeness of our sample. There are 7 814 sectors for DR5, 5 831 of which have one or more POA 
objects in them. Using the RegionID field in the target table (which gives the sector identification number if set, 



see also http://www.sdss.org/dr6/algorithms/tiling.html 
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Fig. 15. — Geometry of the SDSS DR5Q Jackknife areas, showing the location of the DR5Q PRIMARY (orange/grey dots) and the 
UNIFORM (black dots) samples. The jackknife areas were chosen to follow the overall geometry of the SDSS Quasar survey. The number 
of quasars in each area is approximately equal. Note the sparse coverage of the UNIFORM sample in the Southern Stripes. 



zero otherwise) we match the positions (R.A.'s and Decs) of objects in target to those that are in PhotoObjAll and 
the DR5Q. 

The efficiency of the quasar targetting algorithm is ~ 95% (Vanden Berk et al. 2005T ). We can define two functions for 
the primary sample which have dependence on angular position in the sky only in order to calculate the completeness 
of the survey: 

• Coverage Completeness, /q. The coverage completeness is the ratio of the number of quasar targets that are 
assigned a spectroscopic fibre to the total number of quasar candidates in a given sector. Fibre collisions will be 
one contributing factor in the coverage completeness. 

• Spectroscopic Completeness, f s (9)- This is the ratio of the number of high-quality spectra obtained in a sector 
to the number spectroscopically observed. Due to the nature of the SDSS quasar survey, this ratio tends to be 
very high. 

The 'overall completeness', /o, is defined as fo — fq x fs and the distribution of this overall sector completeness is 
shown in Fig. [T3J 

B. JACKKNIFE ERRORS 

Here we follow IScranton et all l|2002L §§3.4.5, 11.3 and their Eq. 10). lZehavi et alJ 1)20021 §3.4 and equation 7) and 
IMvers et al.l (120071. A ppendix Al to calculate the jackknife error estimates on our quasar clustering data. 
iMvers et all (120071) estimate errors using an "inverse variance" weighted jackknife technique. This method divides 



the data into N sub-samples and then recalculates the given statistic (e.g. £(s)) using the Landy-Szalay estimator 
(equation [5]), leaving out one sub-sample area at one time. Following their convention we denote subsamples by the 
subscript L and recalculate £(s)l in each jackknife realization via equation^ The inverse- variance- weighted covariance 
matrix, C(si, Sj) = Cij, is 



where £ denotes the correlation function for all data. Jackknife errors <Ji are obtained from the diagonal elements 
(af = Cu), and the normalized covariance matrix, also known as the regression matrix, is 

\C\ = Qi- (B2) 

0~i<Jj 

We divide the sample into 21 sub-samples. The number of subdivisions is chosen such that each represents a 
cosmologically significant volume, while retaining sufficient numbers of objects that shot noise will not dominate any 
subsequent analysis. The detailed boundaries of the sub -samples are given in Table [4] and described by Fig. [151 

We find, as in previous quasar clustering work (e.g. IShanks &: Boyld["l994t ICroom fc Shanks! [19961 ) . that Poisson 
errors are a good description on scales where DD q < N q , where N q is the number of quasars in a given sample and 
DD q is the number of quasar pairs in a given bin. On larger scales, the Poisson error tends to underestimate the 
Jackknife error, see Fig. 1171 The scale where N q w DD q is ~ 70 h' 1 Mpc for the SDSS UNIFORM Quasar sample. 
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Region 




RA min 




RA max 


Dec min 


Dec max 




No. of 




No. of 












Quasars 


Randoms 


N01 


120. 


140. 


-5. 


12. 


29 445 


870 558 


N02 


140. 


168. 


-5. 


18. 


28 456 


841 541 


N03 


168. 


196. 


-5. 


18. 


27 904 


825 442 


N04 


196. 


225. 


-5. 


18. 


28 717 


846 926 


N05 


225. 


256. 


-5. 


11. 


29 891 


879 837 


N06 


108. 


136. 


14. 


23.5 


29 614 


873 778 


N07 


108. 


136. 


23.5 


35. 


28 646 


845 871 


N08 


136. 


186. 


22. 


40. 


26 942 


798 307 


N09 


186. 


236. 


22. 


40. 


27 957 


820 491 
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Fig. 16. — The normalised covariance matrix, (the regression matrix) for from jackknife error analysis on 21 sub-samples of the 
UNIFORM DR5Q, for scales 1 h' 1 Mpc < s < 25 h' 1 Mpc. 

Given the smallness of the off-diagonal elements of the covariance matrix, we measure errors using the diagonal 
elements only. But here we carry out a check using the full covariance matrix. We fit the observed £(s) to the power 
law model using the full covariance matrix. We calculate \ 2 as 

X 2 = - Z™ d (si)}Cr/[Z( Sj ) - £ mod ( S ,)] (B3) 

ij 

where C^ 1 is the inverse covariance matrix, and £mod(s) = (s/so) _7s is °ur model, where we vary sq over the range 
so = 0.0 — 15.0 hr 1 Mpc in steps of 0.05 hr 1 Mpc and j s over the range j s = 0.00 — 3.00 in steps of 0.01, fitting £(s) 
on scales from 1 h^ 1 Mpc < s < 25.0 h^ 1 Mpc scales. 

Our estimates of the redshift-space correlation length and power- law slope are now s = 6.35+o;35 h" 1 Mpc and 
7 S = l.H^o.08 respectively (we found s = 5.95 ±0.45 hr 1 Mpc and "/ s = 1.16t^o8 using the diagonal elements only). 
However, fitting over 1.0 hT x Mpc < s < 100.0 hr 1 Mpc scales, we find there is some tension between the best- fit 
values given in Section 4.1 of s = 5.90 ±0.30 hr 1 Mpc and j s = 1.571^5 and thc best-fit values using thc covariance 
matrix of s = 6.95±gJ| /i" 1 Mpc and 7 S = 1.53 ± 0.09. We believe this is due to the noisy matrix inversion, where 
small values at large scales in the covariance matrix will dominate the signal in the inverse matrix. However, we 
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Fig. 17. — Comparison of Poisson and Jackknife errors for the UNIFORM DR5 Quasar sample. The ratio between the Poisson and 
Jackknife errors (from the diagonal elements of the covariance matrix only) is very close to one at s < 70 h~ 1 Mpc, while at s > 70 h~ 1 Mpc, 
the Poisson errors are ~ double that of the Jackknifes. 
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Fig. 18.— The SDSS DR5 Quasar £(s) for the PRIMARY and UNIFORM samples. The lower panel shows the behaviour of £(s) near 
zero on a linear scale. There is excellent agreement between the two samples at s < 20 /i -1 Mpc, but the PRIMARY sample exhibits a 
higher clustering strength at large scales, s > 40 h~ J Mpc. This result provides our main motivation for using the UNIFORM sample 
exclusively in sections 3 and 4. 

are confident that using the diagonal elements of the covariance matrix only for our model fits does not change the 
interpretation of our results. 

C. SYSTEM ATICS IN THE SDSS QUASAR 2PCF. 

Here we explore the effects of how the various systematic effects in our data, and our methodology affect our 
corre latio n results. We shall determine the effects o f diff erent quasar samples fSec. IC.il) , changing the high-redshi ft cu t 
(Sec. IC.2"|) . the fields which had poor imaging (Sec. IC.4"|) . Galactic reddening (Sec. IC.5j) and fibre collisions (Sec. IC.6"|) . 
We shall report on £(s) and w p (r p )/r p and find that when using the UNIFORM sample, our overall results (and 
subsequent interpretations) are robust. 

C.l Effects of Different Samples on £(s) 

Figure ITS1 shows the difference in the redshift-space correlation function, £(s), for the PRIMARY sample and of the 
UNIFORM sample using the LS estimator. The two samples are in excellent agreement at small scales, s < 20 h~ x Mpc, 
but the PRIMARY sample exhibits a higher clustering strength at large scales, s > 40 ft. -1 Mpc. One possible 
explanation for this discrepancy is due to the differing radial distributions in PRIMARY and UNIFORM resulting 
from the different target selection used before DR2. This result provides our main motivation for using the UNIFORM 
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Fig. 20.— The SDSS DR5 Quasar for the UNIFORM sample with the sample split into the NCG and SCO. The two measurements 
are in good agreement. 

sample exclusively in sections 3 and 4. 

C.2 High redshift cut-off 

Figure fT9l shows the redshift-space 2-point correlation function £(s) for the UNIFORM sample with the high-redshift 
cut-off being changed from z < 2.2 to z < 2.9. It is reassuring that the change between £(s) is minimal, though this 
is somewhat unexpected since our the optical selection for the quasar sample is known to be affected between z = 2.1 
and z = 2.9 (|Richards et al.ll2006f ). 

C.3 The NGC vs. the SGC 

Figure [20l shows the redshift-space 2-point correlation function £(s) for the UNIFORM sample, split into quasars 
from the North Galactic Cap (NGC) and the South Galactic Cap (SGC). Note the data is heavily dominated by the 
NGC in the UNIFORM sample. There is no detectable signal in the SGC £(s) below s w 10 h~ x Mpc and the two 
measurements are in good agreement. 

C4 Bad Fields 

In the SDSS, a "field" is an image in all five bands, with approximate dimensions of 13' x 10'. Since the quasar 
target selection algorithm searches for outliers from the stellar locus in colour space it is very sensitive to data with 
large photometric err ors due to problems in photometric calibration or i n point-spread f unctio n (P SF) determinatio n 
([Richards et al.ll2006ft . Thus, using the definitions of "ba d fields" given bvlRichards et all ()2006l ) and lShen etaD (|2007f ). 
based on the position of the stars in colour-colour space (jlvezic et alj|2004h . we calculate the correlation function both 
including and excluding data from these areas. 
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Fig. 22. — The SDSS DR5 Quasar £(s) for the UNIFORM sample with the sample split by E(B - V). We see no systematic difference 
in the clustering signal between the two £(s) measurements that might have been caused due to errors in the reddening correction model. 

Figure I2T1 shows the redshift-space 2-point correlation function £(s) for the UNIFORM sample (solid black circles). 
Also shown is £(s) for the PRIMARY sample including, (solid green) and excluding, (dashed red) lines, the "Bad 
Fields" as defined by IS hen et al.l (|2007f ). Here we can see that there is minimal difference (for the PRIMARY DR5Q 
sample) between the £(s) estimates when including and excluding the bad fields. This results is reassuring but 
generally expected since at z < 2.2 quasar selec t ion u sing the UV excess technique is relatively insensitive to "bad 
fields" . However, at higher redshift, IShen et all (|2007l ) found this to be a major issue, where the selection is more 
sensitive. 

C.5 Reddening 

While all selection for the quasar sample is und ertaken using dereddened colors ((Richards et al.| [2002) following the 
Galactic extinction model of Schlcgcl ~et alj (|1998[ ). any remaining systematic errors in the reddening model can induce 
excess power into the clustering in a number of different ways. The most obvious possibility comes from a modulation 
in the angular density of quasars as a function of position on the sky. In addition the color dependence of the reddening 
correction may preferentially exclude quasars at specific redshifts. As we currently assume a common N(z) for all 
quasars in the UNIFORM sample, an N(z) that is reddening-dependent can also induce excess clustering. For this 
analysis we will assume that any artificial signal that might be induced by the reddening correction will scale with the 
magnitude of the reddening correction itself. We therefore subdivide the UNIFORM quasar sample into two subsets, 
of approximately equal number, a low reddening sample, with 0.0028 < E(B — V) < 0.021 7, and a high redden ing 
sample 0.0217 < E(B — V) < 0.2603. The reddening estimates are derived from the maps of ISchlegel et all (|1998l ). 

Figurel2"2"lshows the redshift-space 2-point correlation function £(s) for the full UNIFORM sample, with the reddening 
split sub-samples. The low reddening component, dot-dashed (green) line and the high reddening sample, dotted (red) 
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Fig. 24. — The SDSS DR5 Quasar f(s) for the UNIFORM sample: with no fibre collision correction (filled circles); using photometric 
redshifts for quasar candidates that we not observed due to fibre collisions (green, dot-dashed line) and using the rcdshifts from the nearest 
observed quasar for quasar candidates that we not observed due to fibre collisions (red, dotted line). We see very little difference on scales 
s > 5 h~ 1 Mpc but do measure increased values of £(s) at ~ 1 — 5 h" 1 Mpc. 

line are consistent within the errors for all scales out to ~ 250 hr 1 Mpc. There is no evidence for a systematic 
difference in the clustering signal on large scales that might be induced by any modulation due to errors in the 
reddening correction. 

C.6 Fibre Collisions 

Due to the design of the SDSS fibres and plates, no two spectroscopic fibres can be separated by less than 55" 
(Section 2.1). The corresponding minimum physical separation in r p that we can sample is shown by Fig. 1231 

To investigate this effect on our correlation function estimates, we find which of the 145 524 POA objects were not 
observed due to fibre collisions. There are 431 objects that were within 55" of a UNIFORM quasar that were not 
observed. We assign the "collided" quasar candidates a redshift using two met hods. First, (model (1) in Fig. l24l 
and Fig. [25]), using the new version of the SDSS Quasar photometric catalogue, (Richar ds et al.ll2009t ). we assign the 
redshift of the nearest photometric quasar to the collided objects. We assume that all the collided objects are in fact 
quasars, though in reality this is not the case. Second, (model (2) in Fig. [Ml and Fig. |2"B"|) we assign the collided quasar 
candidate the redshift from the quasar that "knocked it out". We then recalculate the 2PCF with these additional 
objects. 

As we can see from Fig. [24j the inclusion of these collided objects makes very little difference to our measurement of 
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Fig. 25. — The projected correlation function w p (r p )/r p for the SDSS DR5Q UNIFORM sample with 0.30 < z < 2.2 with no fibre 
collision correction (filled circles); using photometric redshifts for quasar candidates that we not observed due to fibre collisions (green, 
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Fig. 26. — The projected correlation function w p (r p )/r p for the SDSS DR5Q UNIFORM sample with 0.30 < z < 2.2, varying 7r ma x from 
equation[9] The lower panel has the data divided by the best fit power law from Section 4, with rg = 8.75 h~ 1 Mpc and 7 = 2.40. 



£ (s) at scales > 5 h 1 Mpc. However, we do measure increased values of £(s) at s = 1 — 5 h 1 Mpc. Therefore, we again 
fit a single power-law to the data which has been corrected for fibre collisions using the photometric quasar redshifts, 
over the scales 1 < s < 25 h" 1 Mpc and find s = 6.70±g;|jj h' 1 Mpc and 7^ = 1.29lg;^ (cf. s = 5.95 ±0.45 h~ l Mpc 
and 7 S = 1.16^q 08 found in Section 4.1). With the inclusion of more data at small separations, the fibre-corrected 
£(s) has a higher Sq value and steeper slope, but we find these results are consistent with our measurement of £(s) 
without the fibre collision corrections, given the errors. 

By examining Fig. [23 we see that fibre collisions do not account for the possible break in the slope of w p (r p )/r p that 
was discussed in S ection 4.3. We are thu s sat isfied that fibre coll isions do not impact the results presented herein and 
refer the reader to iHennawi et al.l (|2006f ) and iMvers et alj (|2008f ) for more detailed investigations of quasar clustering 
and quasar binaries on these very small scales. 

C. 7 Varying 7r ma x limits for w p (r p ) 
Figure [26] shows the projected correlation function w p (r p )/r p for the SDSS DR5Q UNIFORM sample with 0.30 < 
z < 2.2, varying 7r max from equation [9[ We vary 7r max in intervals of 10 2 over the range 7r max = 10 1 - 4-2 = 
25.1 — 100.0 Mpc. Although changing the 7r max cut does produce a noticeable effect in estimates of w p (r p )/r p , 
when fitting our single power law over the scales 4.0 < r p < 130.0 hr 1 Mpc, we do not see a significant change in the 
best-fit ro or power-law slope values, with the former constant at ro « 8.3 hr 1 Mpc and the latter constant at 7 rs 2.3. 
We are therefore confident that the integration limit of 7r max = 63.1 hr 1 Mpc provides a good balance between larger 
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7r values which would add noise to our w p (r p )/r p estimate, and lower ir values which might not recover the full signal 
at the largest separations. 
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